Learning deep representations for video-based intake gesture detection

Rouast, Philipp V.; Adam, Marc T. P.

Title: Learning deep representations for video-based intake gesture detection
Creator: Rouast, Philipp V.; Adam, Marc T. P.
Relation: IEEE Journal of Biomedical and Health Informatics Vol. 24, Issue 6, p. 1727-1737
Publisher Link: http://dx.doi.org/10.1109/JBHI.2019.2942845
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Resource Type: journal article
Date: 2020
Description: Automatic detection of individual intake gestures during eating occasions has the potential to improve dietary monitoring and support dietary recommendations. Existing studies typically make use of on-body solutions such as inertial and audio sensors, while video is used as ground truth. Intake gesture detection directly based on video has rarely been attempted. In this study, we address this gap and show that deep learning architectures can successfully be applied to the problem of video-based detection of intake gestures. For this purpose, we collect and label video data of eating occasions using 360-degree video of 102 participants. Applying state-of-the-art approaches from video action recognition, our results show that (1) the best model achieves an F1 score of 0.858, (2) appearance features contribute more than motion features, and (3) temporal context in form of multiple video frames is essential for top model performance.
Subject: deep learning; intake gesture detection; dietary monitoring; video-based
Identifier: http://hdl.handle.net/1959.13/1429319
Identifier: uon:38698
Identifier: ISSN:2168-2194
Rights: This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
Language: eng
Full Text
Reviewed

Hits: 632
Visitors: 714
Downloads: 84

		Thumbnail	File	Description	Size	Format
View Details Download			ATTACHMENT02	Publisher version (open access)	2 MB	Adobe Acrobat PDF	View Details Download